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Abstract. In 1888 Hilbert showed that every nonnegative homogeneous polynomial F with 
real coefficients is a sum of squares of polynomials if and only if F is a quadratic form, a 
binary form or a ternary quartic. In these cases, the study of these decompositions brings 
to new representations for nonnegative polynomials. Starting from the Caratheodory theorem 
we calculate the so-called Caratheodory number of the Hilbert cones of quadratic forms and 
binary forms. 

1. Introduction 

The study of the decomposability of a given nonnegative polynomial as a sum of squares is an 
old problem in mathematics: in 1888 Hilbert showed that every nonnegative homogeneous poly- 
nomial F e R[xi, ...,x n ] of even degree d is a sum of squares of other polynomials if and only 
if n = 2 (binary forms), d — 2 (quadratic forms) or (n, d) = (3,4) (quartics in three variables). 
In the 20th century were producted various examples of nonnegative polynomials that can't be 
written as a sum of squares (see the works by Motzkin, Robinson, Choi and Lam on ternary 
sextics and quaternary quartics). 

We start from Hilbert's classical result, studying some aspect of particular decompositions. Ob- 
serve that the set of nonnegative polynomials is a convex cone in a suitable real vector space. So it 
is interesting to study its extreme points, that are points that cannot be non-trivially decomposed 
as sum of other elements of the set. Extreme points are the fundamental elements of the cone: in 
fact, by Caratheodory Theorem (see Theorem 12. 2[) , if the dimension of the vector space is finite 
and under some suitable conditions, every element of the cone can be written as a finite sum of 
extreme points. Furthermore, given x in the cone, if we call the height of x the minimum integer t 
such that x is a sum of t extreme points, from Caratheodory theorem one obtains that this value 
is globally bounded by the dimension of the cone itself. Since every element of the cone P„ i( j of 
nonnegative polynomials is a finite sum of extreme points, it is interesting to calculate the height 
of a generic F £ P n .d and also to calculate the maximum height of a nonnegative polynomial: 
this value is the so-called Caratheodory number of P n .d- 

Calculating the Caratheodory number C(X) for a given convex set X is a general problem which 
finds applications in many fields of mathematics. Although Caratheodory theorem is a classical 
result in convex analysis, nowadays there exist various open problems concerning the number C, 
both for convex sets and for compact convex sets. This problem is also naturally related with 
operational research problems; for compact convex sets like orbitopes (see [SSSJ ) this number is 
known only in some particular cases. Independently from its formal definition, it is possible to 
exploit various results about Caratheodory number in numerical methods for the resolution of 
optimization problems (for an application, see jGLSj ). For example, if you want to represent, 
with the method of least squares, an element a; of a given cone C such that x = X)i=i e * with 
respect to the class {&;} of extreme points of C, it is clear that the degrees of freedom of this 
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representation increse as t increses. So it is interesting to find the optimal (i.e. the minimum) t 
such that this representation is possible for every x in the cone. 

The main result about binary forms, which is the most interesting of this paper, says that the 
Caratheodory number of P2,d is 2 for every even degree d: so every nonnegative binary form is 
spanned by a pair of extreme points. This implies Corollary 13. 131 about extremal representations 
of these polynomials. 

For this paper we used some results and considerations mainly due to Bruce Rcznick ( |Rel| and 
|Re2| ) about Hilbcrt cones and representations of nonnegative polynomials, while other interest- 
ing results about the structure of P n ,d can be found in [B] and |BH| . Finally, we would like 
to recall the work |KSJ by Karlin and Shapley, where they obtained a similar result studying 
the connection between the structure of the cone of polynomials that are nonnegative over the 
interval [0, 1] and the moment problem: but this is neither necessary nor sufficient for our results. 

This paper is based on the results of my master thesis. I would like to thank Giorgio Ottaviani 
and Marco Longinetti for their fundamental aid and their constant encouragement to improve 
this work. 

2. Preliminaries 

Let K be the field of real numbers, and let H be an K-vector space of finite dimension. 

Definition 2.1. A subset C C H is a cone if for each a E C and each a > then aa E C. 

The dimension of a cone is the dimension of its affinc hull. We will be interested in closed and 
convex cones of finite dimension. It is easy to see that a cone C is convex if and only if, given 
a,beC, a + beC. 

Definition 2.2. Let C be a closed cone. An element e E C is an extreme point for C if every 
decomposition e = /i + fit with /, E C, implies fi = c^e, for some Ci > 0. 

So, a point a; in a cone is extreme if and only if it cannot be non-trivially decomposed as the sum 
of two other elements of the cone. We denote by Ext(C) the set of extreme points of C. It is 
clear that [i ■ Ext(C) = Ext(C) for every \i E R + . Moreover, in a closed convex cone the set of 
extreme points is a subset of the topological boundary of the cone. We will refer to the following 
two classical facts: the first is a version for convex cones over a compact set of the Krein-Milman 
theorem, for the second you can see an easy proof on [Relj . 

Theorem 2.1 (Krein-Milman). A closed convex cone that doesn't contain lines is the convex hull 
of its extreme points. 

Theorem 2.2 (Caratheodory). Let C be a convex cone of dimension N containing 0, and z = 
x j w ith Xj E C. Then there exist {y\, . . . ,i/n} Q C such that z — 5^ i=1 JJi and that Vi 3j 
s.t. yi = £jXj, for some 6j < 1. 

So one obtains 

Corollary 2.3. Let C be a closed convex cone of dimension N that doesn't contain lines. Then 
every x E C has a representation as sum of at most N extreme points of C . 

Proof. Let x E C . Then, by Thcorcm l2.ll x can be written as sum of a finite number of extreme 
points of C, and by Theorem 12.21 this number can be reduced to N . □ 
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We will call extremal a decomposition of an element x G C of the type x = ^ e, for e, G Ext(C). 
By Corollary 12.31 we can give the following definition. 

Definition 2.3. Let C be a closed convex cone of finite dimension that doesn't contain lines, 
and let x G C . We call the height of x the number h(x) = min{r G N x = Yli=i e i f° r some G 
Ext(C)} and the Caratheodory number of the cone C the number 

C(C) = max {h(a:)}. 

Remark 2.1. Notice that h(x) < dim(C) for every x G C, that is C(C) < dim(C). 

3. The Caratheodory number for Hilbert cones 

We will denote by F„ i£ ; the vector space R[a;i, x n ]d of homogeneous polynomials of degree d in 
n variables, which is the d— th homogeneous component of the gradued ring R[iei, x n ], by 

(3.1) P n , d = {fe F n , d | P(X) >0,VI = (n,.. . ,xn) g R" j 

the set of nonnegative polynomials in n variables of degree d, and by 

t 

(3-2) E„, d = {]>> 2 | fli eF„ fl ( GN} 

»=i 

the set of sums of squares. Of course one has the trivial inclusion S n< d C P„.d for each (n, d). In 
Hilbert characterized the cases when this inclusion is an equality. 



Theorem 3.1 (Hilbert, 1888). P n .d = ^n.d if and only if one of the following cases occurs: 

(a) n — 2 (binary forms) 

(b) d = 2 (quadratic forms) 

(c) (n,d) = (3, 4) (ternary quartics) 

If we denote by 

(3.3) {(n,2),(2,d),(3,4) n G N, d G 2n|, 

Hilbert's theorem says that P ni d = if and only if (n, (i) G P n ,d and S„ jC j are the so-called 
Hilbert cones of polynomials. In fact, it is easy to show the following proposition (for a proof you 
can see Proposition 3.6 of [Relj V 

Proposition 3.2. P n> <j and S n> d are full- dimensional closed convex cones o/F„ )( j- 

For F G F„ )( j denote by Vr(P) the set of real points of the variety {F = 0}. 

Proposition 3.3. If F G 9P n ,d i/iera every pomi o/ Vr(F) C R™ is singular. 

Proof. It is clear that the interior of V n ,d consists of polynomial that are positive definite over 
R™, that is, for every F in the interior, F(X) > for every X G R". Let F G <9P„, d . If a G K" 
such that F(a) = 0, then by nonnegativity VF(a) = 0, that is a is a singular point for Vr(-F). □ 

Proposition 3.4. If(n,d) G * then Ext(P„, d ) C F^™ = jp 2 | P e F n A. 

Proof. In fact if (n, d) G ^, then P n ,d = ^n,d- So, let F G Ext(P ra> d), then P = Y^k=\9k where 
w is the mimimum integer for this type of decomposition. So gi ^ agj for j 7^ j for every a, 
because otherwise we could write 

tv—l 

(3-4) F= + + = 
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and this is a contradiction. Since F is extreme, the only possibility is that w = 1, so that F is a 
square. □ 



So, if (n,d) e *, then Ext(P n , d ) C F 2 ,n dP n4 . Now, we are interested in characterizing 
the subset of extreme points of P n ,d when (n,d) £ 'J/, and in calculating in some cases the 
Caratheodory number of the cone P n ,d- Remember that, by Remark 12.11 we obtain the upper 
bound 

(3-5) C(P n , d )< ^ + d ~ 

3.1. Apolarity and the Veronese orbitope 
Let's consider (for every even degree d) the set of polynomials 



a k G R n ,r e N 



Qn,d ={F = J2(<Xk- Xf 

k=l 

(where a k ■ X — a\X\ + • ■ • + a n x n ) of finite sums of d— th powers of linear forms, which is a 
closed convex subcone of E ni <j. Let P n denote the n— dimensional projective space over the real 
field. Since every element of Q n ,d is a sum of d—th powers, it is clear that the subset of extreme 
points of Q n .d consists of all d—th powers of linear real forms: this set is related to the image of 
the Veronese map ^ l _i jm (P™~ 1 ), where we have defined 

(3.6) Y k4 : ¥ k -> pC^")" 1 S P(S d (M fc )), 

with 'fk,d([xo ■ ■■■ ■ Xk]) = [••• ■ X 1 : • • •], where I ranges over the set I(k + l,d) = {I = 
(io, . . . , ik) | ij & N and ^2 ij = d}. This image parametrizes the variety of d—th powers of linear 
forms, subvariety of the space p( d ) _1 of hypersurfaces of P fe of degree d: so the Zariski closure 
Ext(Q njC j) coincides with the Veronese variety in the projective space Pv d > We obtain that 
the cone Q n> d is the convex hull of the Veronese variety, that is the so-called Veronese orbitope 
(see [SSS] V 

Now, let M.[d Xl , . . . , d Xn ]d = F' n d be the dual ring of F n ,d. We have that F ni <j is generated by 
the set of monomials {X I }j £ x(n,d) where X 1 = x 1 ^ ■ ■ ■ x % ™ while F^ d is generated by the set of 
monomials of derivatives {^j}jez(n,d) where di = rp - !^ ' " ■ ® % x n and c(J) = rp~! ■ Consider the 
natural map a: F n4 — >• d such that cr^X 1 ) = ^v. This map produces the following bilinear 
form on F„ iC j: the map r: F n4 X F„ i( j — > K such that t(F,G) = (o~(G)) (F). Using coordinates, 
if F = J2c(I)a F (I)X I and G = Y,c{I)a G (I)X I we obtain 

t(F, G)=J2 c(I)a F (I)a G (I) = r(G, F). 

Under this product the cones P n ,d and Q n ,d are mutually dual, as shown in jRelj . Now, fixed 
F £ F n4 and 1 < i < d — 1, consider the i-th apolarity map 

(3.7) AP F (i,d-i;n):F n , d - i -+F n ,i 

sending G n> (a(G)) (F). AP F (i 7 d — i;n) is linear: the matrix of the map AP^(|, |;n) is the 
catalecticant (matrix) of F, and it is denoted by Hp-. If F is a quadratic form of matrix Mj? , 
then Hi?=Mi?;ifFisa binary form of degree d, Hi? is a Hankel matrix of order | + 1 whose 
terms on the diagonals are the coefficients of F. The catalecticant of F € P n ,d is strictly linked 
to the decomposability of F as a sum of squares or powers of linear forms. For example, S* d 
(the convex cone dual to is the set of nonnegative polynomials whose catalecticant matrix 
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is positive semidefinite; so, if (n, d) G ^, Q n .d = E* d has this property. Rcznick shows the 
following interesting fact f |Relj . Theorem 4.6): 

Theorem 3.5 (Rcznick). For every F G Qn,d let w(F) be its height in Q„ j( i. Then, for every F , 
w(F) > rk(H^). The equality holds for every F G Q n ,d if and only if (n, d) £ 

This result, considered in the context of Carathcodory number means that C(Q n> d) is always 
higher than the maximum rank of the catalccticant matrix of a generic sum of d— th powers of 
linear forms on n variables. 

Proposition 3.6. We obtain the following results about C(Q„.d). 



(n,d) 


C(Qii,d) 


(n,2) 


C = n 


(2,d) 


C = f + 1 


(3,4) 


C = 6 




r:^ 1 ) < c < r^r 1 ) 





3.2. Quadratic forms 

Let's analyze the first case of the Hilbert's theorem, that is P n ,2 = {F G F n> 2 | F(X) >0VXe 
M n }, n G N. There is a natural bijection between the cone P„,2 of nonnegative real quadratic 
forms on n variables and the cone of symmetric positive semidefinite matrices {M^}, n x n with 
real coefficients; in fact a nonnegative quadratic form F G P n .2 has a representation of the form 
F(X) = t MpXMp, for every X G R n . Diagonalizing the matrix one obtains a diagonal form 
for F, that is 

rkMjr 

(3.8) F(X)= ^-^) 2 ; 

i=l 

that is a sum of rkMi? squares of linear forms. 
Proposition 3.7. Ext(P„, 2 ) = F£,i = <9P„ j2 , V n G N. 

Proof. We already know that Ext(P n< 2) C -l by Proposition 13.41 and 1 = 9P ni 2 because 
every polynomial of the boundary has at least one double real root: so the bound on the degree 
necessarily forces the equality. Let now F G F^ x . Then F = (a ■ X) 2 = (aiXi + ■ ■ ■ + a n x n ) 2 
where a = {a\ . . . a n ) G W l . The (real) zeros of F compose the set of points of K" orthogonal to 
a, that is the hyperplane a 1 - . If F = Fi + F 2 with F t G P n ,2, w e obtain a x = V(-Fi) H V^), 
that is V(Fi) = V{F2) = a 1 - because V(Fi) are hyperplanes too. Since deg-Fi = degi<2 = 2, 
there exist s.t. Fi = \{F and so F G Ext(P nj 2). □ 

Now, since every square of a linear form is an extreme point of the cone P n ,2, the decomposition 
(|3.8[) is extremal and it is the minimal decomposition of F as a sum of squares of linear forms. 
That is, h(F) = rkMp for every F G P n ,2 and so we conclude that the Caratheodory number 
C(P„.2) is the maximum rank of a nonnegative quadratic form on n variables, that is 

Theorem 3.8. C(P„ : 2) = n, for every n G N. 

We remark that there is no possibility of uniqueness of the extremal representations in P n< 2- In 
fact: 
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Example 3.1. Take G n (X) > 1, here represented with respect to the 

orthonormal canonical base of W 1 . If we impose 

G n (xi, . . . , x n ) = x\ H \-x? = ^Qffcixi + ... + afc„£ n J = 

fe=i 

= + ■■■ + a ln x n + 2 X] °<kiOtkjXiXj 

k—1 i<j 

we obtain n+ (™) = " +" quadratic conditions on the matrix M = (oiij), that force M G SO(n, M). 
So the family of extremal representations of G n is parametrized by the group SO(n,R). 

3.3. Binary forms 

A polynomial F G P 2 ,d has the form 

d 

F(x,y) '. 

3=0 

for some real numbers Cj such that it is nonnegative over R". By the Fundamental Theorem of 
Algebra, over the complex held F splits into a product of linear forms. By nonnegativity, the real 
roots of F have even multiplicity, and so 

(3.9) F(x, y) = Y[ (atx - b t y) 2kt Y\_ t"" 1 _ b a y)(a a x - b a y) 

t a 

where the hrst product ranges over the real roots of F (and the multiplicity of the t-th root is 2fc t ), 
while the second one ranges over the set of couples of the complex roots with their conjugates. 
So F is the product of a square times a product of two complex conjugated polynomials, that 
is F = R 2 (d + iC 2 )(Ci - iC 2 ) = (RCx) 2 + (RC 2 ) 2 . Then we can see first of all that every 
nonnegative binary form is always a sum of at most two squares. 

Theorem 3.9. Let F G P2,d- Then F is extreme for the cone V 2 ,d if only if F is the square 
of a polynomial with only real roots. 

Proof. First suppose that F = G 2 has only real roots and F = F\ + F 2 with Fi G P 2 ,d- One 
has V(F) = V(F 1 ) n V{F 2 ) by nonnegativity, so V(F) C V(Fi) for i = 1,2. Moreover, every 
linear form of the factorization of F must appear also in the factorization of the F^s. In fact, 
let L(x,y) = ax — by be a linear form that divides F; then = F(b,a) = F\{b,a) + F 2 (b,a) 
and by the nonnegativity (6, a) nullifies both Fj's, so L divides them. Now, if F = L 2 H, L 
divides Fi, and so does L 2 , that is Fi = L 2 Ji. Proceeding by induction, it is easy to see that if 
F = L 2t H then L 2t divides the F^s. Iterating this process for each linear form of the factorization 
of F, remembering that F,Fi,F 2 have degree d and that F has only real roots, we obtain that 
V(F) = V{Fi) = V(F 2 ) = L 2tl U • • • U Lf k and that F, F U F 2 define the same hypersurface. So 
there exist Ai, A2 G R, nonnegative, such that Ai + A2 = 1 and Fi = XiF, from which F must 
be extreme. Vicevcrsa, let's suppose that F is extreme in P 2 .<j- So, there exists G G F2, s such 
that F = G 2 , as we observed. Let's suppose that G has a complex root a. Observe that, after a 
suitable translation F(x(ip, ip),y(tp, = F(tp,ip), we can suppose that this root is imaginary. 
So 

2t / \ 2t / \2t 



F(cp,ip) = f(p -ia2ifj (p + ia2ip) ■ H 2 {f, VO = (V + {a 2 ip) 2 ^ 




■H 2 
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and so 

F(x, y) = F[<p(x, y), if>(x, !/))=£ ( ■ ) (( x ~ W) 2 * - ' ' H ) 

and for each i = 0,...,2t the z-th summand is not a multiple of F. So we found a non-trivial 
decomposition of F as a sum of other elements of the cone. We have a contradiction because F 
is extreme by assumption. □ 

So, for binary forms there exist perfect squares that are not extreme points of P 2 ,d'- for example, 
the polynomials Jd = (x* + yi) 2 , d even, belong to F 2 d \ Ext(P2 a) for every d > 4 even. 

■2 

Definition 3.1. Let be F g P 2 ,d- We will call a partition of the roots of F, a couple (A, A) € 
F 2 d x F 2 d of conjugate polynomials such that 

(1) F = A-A 

(2) Given a root of F, then a is a root of A if and only if a is a root of A. 

So, if A(x,y) = G{x,y) + i • H(x,y) then y) = G(x,y) — « ■ H(x,y) and we obtain the 
decomposition 

(3.10) F = G 2 + H 2 . 

We can see that these are the only representations for F as a sum of two squares. 

Theorem 3.10. Let F be a binary form of even degree d and G, H polynomials with real coeffi- 
cients such that F = G 2 + H 2 . Then there exists a partition (A, A) of the roots of F such that 
w.l.o.g. G is the real part and H is the imaginary part of A. 

Proof. Let F be as in the hypothesis. So F = (G + iH)(G — iH) and the polynomials G + iH e 
G — iH have degree i . In the complex field the two polynomials are product of | linear forms, 
and so all their roots are root of F. If G(a) + i ■ H(a) = then G(a) — i ■ G(a) = 0. So, calling 
A = G + iH, we conclude. □ 

Lemma 3.11. Let be s > 2 and A = f]i=i ( x - a(i) v) e Cfol/], with Im(a (i) )Im(a (j) ) > for 
every i,j. If A — G(x, y) + iH(x, y) with G, H £ M\x, y], then G and H have only real roots. 

Proof. For every complex number 7 let be 71 = Re7 and 72 = 11117. We proceed by induction on 
s. If s = 2 



A 



(x-ay)(x-0y) = {x 2 -(ai + /3i)a:y+(ai/3i -o^ft^y 2 ) +i(-(a 2 + fi2)xy+{aif3 2 + ct2Pi)y 2 ^j- 

It is easy to see that the imaginary part of A has only real roots, while the real part has only real 
roots if and only if its discriminant 

(3.11) A = (ai - Pij + 4a 2 /?2 

is nonnegative. The hypothesis about the sign of a 2 ■ /?2 concludes the first part. Suppose now 
s > 3. If A = G + iH, then A x = G x + iH x and A y = G y + iH y have degree s — 1; moreover, by 
the Gauss-Lucas theorem the roots of A x and A y lie in the convex hull of the set of roots of A, 
and so also the imaginary parts of the roots of A x , A y have the same sign. The same applies for 
any substitution A(t) = A(at + c,bt + d) (because SL(2, R) acts both on the upper half plane and 
on the lower half-plane), and so, for every R 2 3 (a, b) ^ (0,0), the imaginary parts of the roots 
of aA x + bA y = j^A{t) have the same sign. By induction, since 



(3.12) aA x + bA y = [aG x + bG y \ + i[aH x + bH y , 



the polynomials aG x + bG y and aH x + bH y have only real roots for every R 2 9 (a, b) ^ (0, 0). So, 
by Theorem 1 of |CRj we conclude that G and H have only real roots. □ 

We observe that the statement of |CR] we use, is closely related to Obreschkoff theorem (see for 
example |D]). 

Theorem 3.12. For every even degree d, C{T > 2,d) = 2. 

Proof. Let F e P2,d and let (A, A) be the partition of the roots of F obtained by choosing in A 
the roots with positive imaginary part (or negative, that is the same). Then, if A = G + iH, G 
and H have only real roots and F = G 2 + H 2 is an extremal decomposition for F. □ 

Recalling that every representation of a nonnegativc binary form as a sum of two squares comes 
from a partition of the roots, and that the imaginary part of the polynomial A of any partition 
is a multiple of y, we obtain the following final result about representations of binary forms: 

Corollary 3.13. Let d £ N be even and let F — F(x, y) be a nonnegative binary form of degree 
d whose term on x d has coefficient 1. Then there exist two binary forms L,M of degree | and 
f — 1 respectively, with only real roots and such that F = L? + y 2 M 2 . 
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